Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 56551 |
| Missing cells | 77044 |
| Missing cells (%) | 6.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.1 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 5 |
| DateTime | 2 |
| Boolean | 1 |
| Unsupported | 1 |
store_and_fwd_flag is highly imbalanced (97.8%) | Imbalance |
improvement_surcharge is highly imbalanced (94.4%) | Imbalance |
payment_type is highly imbalanced (58.3%) | Imbalance |
trip_type is highly imbalanced (79.2%) | Imbalance |
congestion_surcharge is highly imbalanced (55.9%) | Imbalance |
store_and_fwd_flag has 3415 (6.0%) missing values | Missing |
RatecodeID has 3415 (6.0%) missing values | Missing |
passenger_count has 3415 (6.0%) missing values | Missing |
ehail_fee has 56551 (100.0%) missing values | Missing |
payment_type has 3415 (6.0%) missing values | Missing |
trip_type has 3418 (6.0%) missing values | Missing |
congestion_surcharge has 3415 (6.0%) missing values | Missing |
RatecodeID is highly skewed (γ1 = 48.08908382) | Skewed |
trip_distance is highly skewed (γ1 = 89.2234337) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
ehail_fee is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
trip_distance has 2870 (5.1%) zeros | Zeros |
extra has 32860 (58.1%) zeros | Zeros |
mta_tax has 5169 (9.1%) zeros | Zeros |
tip_amount has 22366 (39.6%) zeros | Zeros |
tolls_amount has 55023 (97.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-11 04:34:22.881928 |
|---|---|
| Analysis finished | 2024-04-11 04:35:36.468681 |
| Duration | 1 minute and 13.59 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 56551 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28275 |
| Minimum | 0 |
|---|---|
| Maximum | 56550 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2827.5 |
| Q1 | 14137.5 |
| median | 28275 |
| Q3 | 42412.5 |
| 95-th percentile | 53722.5 |
| Maximum | 56550 |
| Range | 56550 |
| Interquartile range (IQR) | 28275 |
Descriptive statistics
| Standard deviation | 16325.012 |
|---|---|
| Coefficient of variation (CV) | 0.57736558 |
| Kurtosis | -1.2 |
| Mean | 28275 |
| Median Absolute Deviation (MAD) | 14138 |
| Skewness | 0 |
| Sum | 1.5989795 × 109 |
| Variance | 2.6650601 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 37705 | 1 | < 0.1% |
| 37694 | 1 | < 0.1% |
| 37695 | 1 | < 0.1% |
| 37696 | 1 | < 0.1% |
| 37697 | 1 | < 0.1% |
| 37698 | 1 | < 0.1% |
| 37699 | 1 | < 0.1% |
| 37700 | 1 | < 0.1% |
| 37701 | 1 | < 0.1% |
| Other values (56541) | 56541 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 56550 | 1 | |
| 56549 | 1 | |
| 56548 | 1 | |
| 56547 | 1 | |
| 56546 | 1 | |
| 56545 | 1 | |
| 56544 | 1 | |
| 56543 | 1 | |
| 56542 | 1 | |
| 56541 | 1 |
VendorID
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 441.9 KiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 56551 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 49213 | |
| 1 | 7338 | 13.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 49213 | |
| 1 | 7338 | 13.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 49213 | |
| 1 | 7338 | 13.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 56551 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 49213 | |
| 1 | 7338 | 13.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 56551 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 49213 | |
| 1 | 7338 | 13.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56551 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 49213 | |
| 1 | 7338 | 13.0% |
| Distinct | 55284 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 441.9 KiB |
| Minimum | 2023-12-31 14:38:47 |
|---|---|
| Maximum | 2024-01-31 23:57:29 |
| Distinct | 55300 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 441.9 KiB |
| Minimum | 2023-12-31 14:46:45 |
|---|---|
| Maximum | 2024-02-01 19:17:30 |
store_and_fwd_flag
Boolean
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3415 |
| Missing (%) | 6.0% |
| Memory size | 110.6 KiB |
| False | |
|---|---|
| True | 115 |
| (Missing) | 3415 |
| Value | Count | Frequency (%) |
| False | 53021 | |
| True | 115 | 0.2% |
| (Missing) | 3415 | 6.0% |
RatecodeID
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3415 |
| Missing (%) | 6.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.151611 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.045251 |
|---|---|
| Coefficient of variation (CV) | 0.90764243 |
| Kurtosis | 4339.8336 |
| Mean | 1.151611 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 48.089084 |
| Sum | 61192 |
| Variance | 1.0925496 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 51077 | |
| 5 | 1867 | 3.3% |
| 2 | 127 | 0.2% |
| 4 | 43 | 0.1% |
| 3 | 19 | < 0.1% |
| 99 | 3 | < 0.1% |
| (Missing) | 3415 | 6.0% |
| Value | Count | Frequency (%) |
| 1 | 51077 | |
| 2 | 127 | 0.2% |
| 3 | 19 | < 0.1% |
| 4 | 43 | 0.1% |
| 5 | 1867 | 3.3% |
| 99 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99 | 3 | < 0.1% |
| 5 | 1867 | 3.3% |
| 4 | 43 | 0.1% |
| 3 | 19 | < 0.1% |
| 2 | 127 | 0.2% |
| 1 | 51077 |
PULocationID
Real number (ℝ)
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.077594 |
| Minimum | 1 |
|---|---|
| Maximum | 265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 74 |
| median | 75 |
| Q3 | 112 |
| 95-th percentile | 244 |
| Maximum | 265 |
| Range | 264 |
| Interquartile range (IQR) | 38 |
Descriptive statistics
| Standard deviation | 57.862401 |
|---|---|
| Coefficient of variation (CV) | 0.60224657 |
| Kurtosis | 1.236271 |
| Mean | 96.077594 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 1.3369984 |
| Sum | 5433284 |
| Variance | 3348.0575 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 74 | 12141 | |
| 75 | 8458 | |
| 95 | 2915 | 5.2% |
| 43 | 2765 | 4.9% |
| 166 | 2649 | 4.7% |
| 82 | 2623 | 4.6% |
| 41 | 2589 | 4.6% |
| 97 | 1898 | 3.4% |
| 65 | 1624 | 2.9% |
| 7 | 1521 | 2.7% |
| Other values (201) | 17368 |
| Value | Count | Frequency (%) |
| 1 | 3 | < 0.1% |
| 3 | 5 | < 0.1% |
| 7 | 1521 | |
| 9 | 5 | < 0.1% |
| 10 | 22 | < 0.1% |
| 11 | 6 | < 0.1% |
| 14 | 25 | < 0.1% |
| 15 | 6 | < 0.1% |
| 16 | 9 | < 0.1% |
| 17 | 85 | 0.2% |
| Value | Count | Frequency (%) |
| 265 | 29 | 0.1% |
| 264 | 112 | 0.2% |
| 263 | 52 | 0.1% |
| 262 | 3 | < 0.1% |
| 260 | 981 | |
| 259 | 11 | < 0.1% |
| 258 | 12 | < 0.1% |
| 257 | 2 | < 0.1% |
| 256 | 92 | 0.2% |
| 255 | 190 | 0.3% |
DOLocationID
Real number (ℝ)
| Distinct | 241 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 140.49985 |
| Minimum | 1 |
|---|---|
| Maximum | 265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 74 |
| median | 140 |
| Q3 | 225 |
| 95-th percentile | 260 |
| Maximum | 265 |
| Range | 264 |
| Interquartile range (IQR) | 151 |
Descriptive statistics
| Standard deviation | 76.556276 |
|---|---|
| Coefficient of variation (CV) | 0.54488511 |
| Kurtosis | -1.2851439 |
| Mean | 140.49985 |
| Median Absolute Deviation (MAD) | 66 |
| Skewness | 0.09043076 |
| Sum | 7945407 |
| Variance | 5860.8634 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 3078 | 5.4% |
| 74 | 2890 | 5.1% |
| 236 | 2725 | 4.8% |
| 238 | 2208 | 3.9% |
| 41 | 1995 | 3.5% |
| 166 | 1819 | 3.2% |
| 42 | 1753 | 3.1% |
| 263 | 1383 | 2.4% |
| 95 | 1352 | 2.4% |
| 239 | 1295 | 2.3% |
| Other values (231) | 36053 |
| Value | Count | Frequency (%) |
| 1 | 24 | < 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 58 | 0.1% |
| 7 | 799 | |
| 8 | 6 | < 0.1% |
| 9 | 18 | < 0.1% |
| 10 | 131 | 0.2% |
| 11 | 5 | < 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 31 | 0.1% |
| Value | Count | Frequency (%) |
| 265 | 180 | 0.3% |
| 264 | 418 | 0.7% |
| 263 | 1383 | |
| 262 | 804 | |
| 261 | 39 | 0.1% |
| 260 | 534 | 0.9% |
| 259 | 13 | < 0.1% |
| 258 | 108 | 0.2% |
| 257 | 39 | 0.1% |
| 256 | 122 | 0.2% |
passenger_count
Real number (ℝ)
MISSING 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3415 |
| Missing (%) | 6.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3091689 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 512 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.97825201 |
|---|---|
| Coefficient of variation (CV) | 0.74723131 |
| Kurtosis | 12.20348 |
| Mean | 1.3091689 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.5193124 |
| Sum | 69564 |
| Variance | 0.956977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 44779 | |
| 2 | 4656 | 8.2% |
| 5 | 1496 | 2.6% |
| 6 | 902 | 1.6% |
| 3 | 596 | 1.1% |
| 0 | 512 | 0.9% |
| 4 | 192 | 0.3% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| (Missing) | 3415 | 6.0% |
| Value | Count | Frequency (%) |
| 0 | 512 | 0.9% |
| 1 | 44779 | |
| 2 | 4656 | 8.2% |
| 3 | 596 | 1.1% |
| 4 | 192 | 0.3% |
| 5 | 1496 | 2.6% |
| 6 | 902 | 1.6% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 6 | 902 | 1.6% |
| 5 | 1496 | 2.6% |
| 4 | 192 | 0.3% |
| 3 | 596 | 1.1% |
| 2 | 4656 | 8.2% |
| 1 | 44779 | |
| 0 | 512 | 0.9% |
trip_distance
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1890 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.491124 |
| Minimum | 0 |
|---|---|
| Maximum | 201421.68 |
| Zeros | 2870 |
| Zeros (%) | 5.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.1 |
| median | 1.79 |
| Q3 | 3.08 |
| 95-th percentile | 7.785 |
| Maximum | 201421.68 |
| Range | 201421.68 |
| Interquartile range (IQR) | 1.98 |
Descriptive statistics
| Standard deviation | 1417.4604 |
|---|---|
| Coefficient of variation (CV) | 45.011426 |
| Kurtosis | 10445.241 |
| Mean | 31.491124 |
| Median Absolute Deviation (MAD) | 0.87 |
| Skewness | 89.223434 |
| Sum | 1780854.5 |
| Variance | 2009193.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2870 | 5.1% |
| 1.4 | 575 | 1.0% |
| 1.3 | 520 | 0.9% |
| 1.2 | 477 | 0.8% |
| 1.1 | 454 | 0.8% |
| 1.5 | 454 | 0.8% |
| 1 | 406 | 0.7% |
| 0.9 | 398 | 0.7% |
| 1.6 | 385 | 0.7% |
| 1.8 | 338 | 0.6% |
| Other values (1880) | 49674 |
| Value | Count | Frequency (%) |
| 0 | 2870 | |
| 0.01 | 95 | 0.2% |
| 0.02 | 83 | 0.1% |
| 0.03 | 55 | 0.1% |
| 0.04 | 36 | 0.1% |
| 0.05 | 42 | 0.1% |
| 0.06 | 48 | 0.1% |
| 0.07 | 42 | 0.1% |
| 0.08 | 35 | 0.1% |
| 0.09 | 33 | 0.1% |
| Value | Count | Frequency (%) |
| 201421.68 | 1 | |
| 154650.47 | 1 | |
| 103153.6 | 1 | |
| 50508.27 | 1 | |
| 49452.14 | 1 | |
| 46383.82 | 1 | |
| 45719.16 | 1 | |
| 45369.11 | 1 | |
| 43267.14 | 1 | |
| 43134.12 | 1 |
fare_amount
Real number (ℝ)
| Distinct | 2225 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.929275 |
| Minimum | -70 |
|---|---|
| Maximum | 1422.6 |
| Zeros | 52 |
| Zeros (%) | 0.1% |
| Negative | 182 |
| Negative (%) | 0.3% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | -70 |
|---|---|
| 5-th percentile | 5.8 |
| Q1 | 9.3 |
| median | 13.5 |
| Q3 | 19.8 |
| 95-th percentile | 40 |
| Maximum | 1422.6 |
| Range | 1492.6 |
| Interquartile range (IQR) | 10.5 |
Descriptive statistics
| Standard deviation | 15.356032 |
|---|---|
| Coefficient of variation (CV) | 0.90706964 |
| Kurtosis | 1334.9067 |
| Mean | 16.929275 |
| Median Absolute Deviation (MAD) | 4.9 |
| Skewness | 19.107181 |
| Sum | 957367.44 |
| Variance | 235.8077 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 2795 | 4.9% |
| 9.3 | 2734 | 4.8% |
| 8.6 | 2703 | 4.8% |
| 10.7 | 2452 | 4.3% |
| 7.9 | 2427 | 4.3% |
| 11.4 | 2320 | 4.1% |
| 12.1 | 2228 | 3.9% |
| 7.2 | 2112 | 3.7% |
| 12.8 | 2067 | 3.7% |
| 13.5 | 1921 | 3.4% |
| Other values (2215) | 32792 |
| Value | Count | Frequency (%) |
| -70 | 6 | |
| -47 | 1 | < 0.1% |
| -40.5 | 1 | < 0.1% |
| -40 | 2 | < 0.1% |
| -35 | 1 | < 0.1% |
| -34 | 1 | < 0.1% |
| -32.9 | 1 | < 0.1% |
| -26.66 | 1 | < 0.1% |
| -22 | 1 | < 0.1% |
| -20 | 3 |
| Value | Count | Frequency (%) |
| 1422.6 | 1 | < 0.1% |
| 445.4 | 1 | < 0.1% |
| 435.6 | 1 | < 0.1% |
| 400 | 7 | |
| 309.6 | 1 | < 0.1% |
| 299 | 1 | < 0.1% |
| 277.4 | 1 | < 0.1% |
| 272.5 | 1 | < 0.1% |
| 266.2 | 1 | < 0.1% |
| 265.5 | 1 | < 0.1% |
extra
Real number (ℝ)
ZEROS 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.90094693 |
| Minimum | -5 |
|---|---|
| Maximum | 10.25 |
| Zeros | 32860 |
| Zeros (%) | 58.1% |
| Negative | 80 |
| Negative (%) | 0.1% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | -5 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2.5 |
| 95-th percentile | 2.75 |
| Maximum | 10.25 |
| Range | 15.25 |
| Interquartile range (IQR) | 2.5 |
Descriptive statistics
| Standard deviation | 1.3443133 |
|---|---|
| Coefficient of variation (CV) | 1.4921115 |
| Kurtosis | 3.7822981 |
| Mean | 0.90094693 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.744451 |
| Sum | 50949.45 |
| Variance | 1.8071782 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32860 | |
| 2.5 | 11386 | 20.1% |
| 1 | 9243 | 16.3% |
| 2.75 | 991 | 1.8% |
| 5 | 763 | 1.3% |
| 5.25 | 600 | 1.1% |
| 7.5 | 341 | 0.6% |
| 3.75 | 198 | 0.4% |
| 6 | 57 | 0.1% |
| -1 | 39 | 0.1% |
| Other values (7) | 73 | 0.1% |
| Value | Count | Frequency (%) |
| -5 | 2 | < 0.1% |
| -2.5 | 39 | 0.1% |
| -1 | 39 | 0.1% |
| 0 | 32860 | |
| 0.5 | 20 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 9243 | 16.3% |
| 2.5 | 11386 | 20.1% |
| 2.75 | 991 | 1.8% |
| 3.25 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 10.25 | 1 | < 0.1% |
| 7.5 | 341 | 0.6% |
| 6 | 57 | 0.1% |
| 5.5 | 1 | < 0.1% |
| 5.25 | 600 | 1.1% |
| 5 | 763 | 1.3% |
| 3.75 | 198 | 0.4% |
| 3.25 | 9 | < 0.1% |
| 2.75 | 991 | 1.8% |
| 2.5 | 11386 |
mta_tax
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.57669626 |
| Minimum | -0.5 |
|---|---|
| Maximum | 4.25 |
| Zeros | 5169 |
| Zeros (%) | 9.1% |
| Negative | 162 |
| Negative (%) | 0.3% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | -0.5 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.5 |
| median | 0.5 |
| Q3 | 0.5 |
| 95-th percentile | 1.5 |
| Maximum | 4.25 |
| Range | 4.75 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3819979 |
|---|---|
| Coefficient of variation (CV) | 0.66239012 |
| Kurtosis | 2.67179 |
| Mean | 0.57669626 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4558102 |
| Sum | 32612.75 |
| Variance | 0.1459224 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 44140 | |
| 1.5 | 7055 | 12.5% |
| 0 | 5169 | 9.1% |
| -0.5 | 162 | 0.3% |
| 1 | 20 | < 0.1% |
| 4.25 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| -0.5 | 162 | 0.3% |
| 0 | 5169 | 9.1% |
| 0.5 | 44140 | |
| 1 | 20 | < 0.1% |
| 1.5 | 7055 | 12.5% |
| 4.25 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.25 | 5 | < 0.1% |
| 1.5 | 7055 | 12.5% |
| 1 | 20 | < 0.1% |
| 0.5 | 44140 | |
| 0 | 5169 | 9.1% |
| -0.5 | 162 | 0.3% |
tip_amount
Real number (ℝ)
ZEROS 
| Distinct | 1384 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2565101 |
| Minimum | -1.66 |
|---|---|
| Maximum | 110 |
| Zeros | 22366 |
| Zeros (%) | 39.6% |
| Negative | 9 |
| Negative (%) | < 0.1% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | -1.66 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 3.5 |
| 95-th percentile | 6.92 |
| Maximum | 110 |
| Range | 111.66 |
| Interquartile range (IQR) | 3.5 |
Descriptive statistics
| Standard deviation | 2.8479567 |
|---|---|
| Coefficient of variation (CV) | 1.2621068 |
| Kurtosis | 83.537732 |
| Mean | 2.2565101 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.6230429 |
| Sum | 127607.9 |
| Variance | 8.1108572 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 22366 | |
| 2 | 2221 | 3.9% |
| 1 | 1722 | 3.0% |
| 3 | 1171 | 2.1% |
| 5 | 646 | 1.1% |
| 4 | 459 | 0.8% |
| 2.3 | 434 | 0.8% |
| 2.16 | 379 | 0.7% |
| 1.5 | 372 | 0.7% |
| 2.5 | 357 | 0.6% |
| Other values (1374) | 26424 |
| Value | Count | Frequency (%) |
| -1.66 | 1 | < 0.1% |
| -1.46 | 1 | < 0.1% |
| -0.9 | 1 | < 0.1% |
| -0.8 | 1 | < 0.1% |
| -0.01 | 5 | < 0.1% |
| 0 | 22366 | |
| 0.01 | 117 | 0.2% |
| 0.02 | 40 | 0.1% |
| 0.03 | 16 | < 0.1% |
| 0.04 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 110 | 1 | < 0.1% |
| 88 | 1 | < 0.1% |
| 70.5 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 50 | 3 | |
| 47 | 1 | < 0.1% |
| 45.61 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
tolls_amount
Real number (ℝ)
ZEROS 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.19120175 |
| Minimum | 0 |
|---|---|
| Maximum | 24.05 |
| Zeros | 55023 |
| Zeros (%) | 97.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 24.05 |
| Range | 24.05 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1907482 |
|---|---|
| Coefficient of variation (CV) | 6.2277055 |
| Kurtosis | 57.45046 |
| Mean | 0.19120175 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.9219463 |
| Sum | 10812.65 |
| Variance | 1.4178812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 55023 | |
| 6.94 | 1378 | 2.4% |
| 3.18 | 68 | 0.1% |
| 5.2 | 18 | < 0.1% |
| 13.88 | 18 | < 0.1% |
| 13.38 | 10 | < 0.1% |
| 14.75 | 9 | < 0.1% |
| 15.38 | 5 | < 0.1% |
| 20.82 | 4 | < 0.1% |
| 20.32 | 4 | < 0.1% |
| Other values (8) | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 55023 | |
| 2.75 | 2 | < 0.1% |
| 3.18 | 68 | 0.1% |
| 5.2 | 18 | < 0.1% |
| 6.55 | 1 | < 0.1% |
| 6.94 | 1378 | 2.4% |
| 10.12 | 1 | < 0.1% |
| 12.14 | 2 | < 0.1% |
| 12.75 | 4 | < 0.1% |
| 13.38 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 24.05 | 1 | < 0.1% |
| 22.32 | 2 | < 0.1% |
| 20.82 | 4 | < 0.1% |
| 20.32 | 4 | < 0.1% |
| 15.5 | 1 | < 0.1% |
| 15.38 | 5 | < 0.1% |
| 14.75 | 9 | |
| 13.88 | 18 | |
| 13.38 | 10 | |
| 12.75 | 4 | < 0.1% |
ehail_fee
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 56551 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 441.9 KiB |
improvement_surcharge
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 441.9 KiB |
| 1.0 | |
|---|---|
| 0.3 | 447 |
| 0.0 | 179 |
| -1.0 | 179 |
| -0.3 | 3 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0032183 |
| Min length | 3 |
Characters and Unicode
| Total characters | 169835 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 55743 | |
| 0.3 | 447 | 0.8% |
| 0.0 | 179 | 0.3% |
| -1.0 | 179 | 0.3% |
| -0.3 | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 55922 | |
| 0.3 | 450 | 0.8% |
| 0.0 | 179 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 56730 | |
| . | 56551 | |
| 1 | 55922 | |
| 3 | 450 | 0.3% |
| - | 182 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 113102 | |
| Other Punctuation | 56551 | |
| Dash Punctuation | 182 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 56730 | |
| 1 | 55922 | |
| 3 | 450 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 56551 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 182 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 169835 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 56730 | |
| . | 56551 | |
| 1 | 55922 | |
| 3 | 450 | 0.3% |
| - | 182 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 169835 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 56730 | |
| . | 56551 | |
| 1 | 55922 | |
| 3 | 450 | 0.3% |
| - | 182 | 0.1% |
total_amount
Real number (ℝ)
| Distinct | 4530 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.403186 |
| Minimum | -76.5 |
|---|---|
| Maximum | 1424.1 |
| Zeros | 41 |
| Zeros (%) | 0.1% |
| Negative | 185 |
| Negative (%) | 0.3% |
| Memory size | 441.9 KiB |
Quantile statistics
| Minimum | -76.5 |
|---|---|
| 5-th percentile | 8.7 |
| Q1 | 13.44 |
| median | 18.42 |
| Q3 | 26.6 |
| 95-th percentile | 49.25 |
| Maximum | 1424.1 |
| Range | 1500.6 |
| Interquartile range (IQR) | 13.16 |
Descriptive statistics
| Standard deviation | 16.956518 |
|---|---|
| Coefficient of variation (CV) | 0.75687973 |
| Kurtosis | 888.36618 |
| Mean | 22.403186 |
| Median Absolute Deviation (MAD) | 5.94 |
| Skewness | 14.485147 |
| Sum | 1266922.6 |
| Variance | 287.52349 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 620 | 1.1% |
| 10.8 | 601 | 1.1% |
| 15 | 538 | 1.0% |
| 10.1 | 522 | 0.9% |
| 12.9 | 517 | 0.9% |
| 11.5 | 493 | 0.9% |
| 8.7 | 472 | 0.8% |
| 9.4 | 460 | 0.8% |
| 12.2 | 450 | 0.8% |
| 13.8 | 433 | 0.8% |
| Other values (4520) | 51445 |
| Value | Count | Frequency (%) |
| -76.5 | 1 | < 0.1% |
| -71.5 | 4 | |
| -71 | 1 | < 0.1% |
| -48 | 1 | < 0.1% |
| -46 | 1 | < 0.1% |
| -41.5 | 1 | < 0.1% |
| -41 | 1 | < 0.1% |
| -36 | 1 | < 0.1% |
| -35 | 1 | < 0.1% |
| -31.9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1424.1 | 1 | < 0.1% |
| 446.9 | 1 | < 0.1% |
| 437.1 | 1 | < 0.1% |
| 401.5 | 1 | < 0.1% |
| 401 | 6 | |
| 311.1 | 1 | < 0.1% |
| 300 | 1 | < 0.1% |
| 278.9 | 1 | < 0.1% |
| 276.5 | 1 | < 0.1% |
| 271.05 | 1 | < 0.1% |
payment_type
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3415 |
| Missing (%) | 6.0% |
| Memory size | 441.9 KiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | 434 |
| 4.0 | 128 |
| 5.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 159408 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 2.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 36660 | |
| 2.0 | 15913 | |
| 3.0 | 434 | 0.8% |
| 4.0 | 128 | 0.2% |
| 5.0 | 1 | < 0.1% |
| (Missing) | 3415 | 6.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 36660 | |
| 2.0 | 15913 | |
| 3.0 | 434 | 0.8% |
| 4.0 | 128 | 0.2% |
| 5.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 53136 | |
| 0 | 53136 | |
| 1 | 36660 | |
| 2 | 15913 | 10.0% |
| 3 | 434 | 0.3% |
| 4 | 128 | 0.1% |
| 5 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 106272 | |
| Other Punctuation | 53136 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 53136 | |
| 1 | 36660 | |
| 2 | 15913 | 15.0% |
| 3 | 434 | 0.4% |
| 4 | 128 | 0.1% |
| 5 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 53136 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 159408 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 53136 | |
| 0 | 53136 | |
| 1 | 36660 | |
| 2 | 15913 | 10.0% |
| 3 | 434 | 0.3% |
| 4 | 128 | 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 159408 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 53136 | |
| 0 | 53136 | |
| 1 | 36660 | |
| 2 | 15913 | 10.0% |
| 3 | 434 | 0.3% |
| 4 | 128 | 0.1% |
| 5 | 1 | < 0.1% |
trip_type
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3418 |
| Missing (%) | 6.0% |
| Memory size | 441.9 KiB |
| 1.0 | |
|---|---|
| 2.0 | 1736 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 159399 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 51397 | |
| 2.0 | 1736 | 3.1% |
| (Missing) | 3418 | 6.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 51397 | |
| 2.0 | 1736 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 53133 | |
| 0 | 53133 | |
| 1 | 51397 | |
| 2 | 1736 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 106266 | |
| Other Punctuation | 53133 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 53133 | |
| 1 | 51397 | |
| 2 | 1736 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 53133 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 159399 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 53133 | |
| 0 | 53133 | |
| 1 | 51397 | |
| 2 | 1736 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 159399 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 53133 | |
| 0 | 53133 | |
| 1 | 51397 | |
| 2 | 1736 | 1.1% |
congestion_surcharge
Categorical
IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3415 |
| Missing (%) | 6.0% |
| Memory size | 441.9 KiB |
| 0.0 | |
|---|---|
| 2.75 | |
| 2.5 | 143 |
| -2.75 | 4 |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.2803749 |
| Min length | 3 |
Characters and Unicode
| Total characters | 174306 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.75 |
|---|---|
| 2nd row | 2.75 |
| 3rd row | 2.75 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 38099 | |
| 2.75 | 14890 | 26.3% |
| 2.5 | 143 | 0.3% |
| -2.75 | 4 | < 0.1% |
| (Missing) | 3415 | 6.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 38099 | |
| 2.75 | 14894 | 28.0% |
| 2.5 | 143 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 76198 | |
| . | 53136 | |
| 2 | 15037 | 8.6% |
| 5 | 15037 | 8.6% |
| 7 | 14894 | 8.5% |
| - | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 121166 | |
| Other Punctuation | 53136 | |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 76198 | |
| 2 | 15037 | 12.4% |
| 5 | 15037 | 12.4% |
| 7 | 14894 | 12.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 53136 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 174306 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 76198 | |
| . | 53136 | |
| 2 | 15037 | 8.6% |
| 5 | 15037 | 8.6% |
| 7 | 14894 | 8.5% |
| - | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174306 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 76198 | |
| . | 53136 | |
| 2 | 15037 | 8.6% |
| 5 | 15037 | 8.6% |
| 7 | 14894 | 8.5% |
| - | 4 | < 0.1% |
| Unnamed: 0 | VendorID | lpep_pickup_datetime | lpep_dropoff_datetime | store_and_fwd_flag | RatecodeID | PULocationID | DOLocationID | passenger_count | trip_distance | fare_amount | extra | mta_tax | tip_amount | tolls_amount | ehail_fee | improvement_surcharge | total_amount | payment_type | trip_type | congestion_surcharge | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2 | 2024-01-01 00:46:55 | 2024-01-01 00:58:25 | N | 1.0 | 236 | 239 | 1.0 | 1.98 | 12.8 | 1.00 | 0.5 | 3.61 | 0.0 | NaN | 1.0 | 21.66 | 1.0 | 1.0 | 2.75 |
| 1 | 1 | 2 | 2024-01-01 00:31:42 | 2024-01-01 00:52:34 | N | 1.0 | 65 | 170 | 5.0 | 6.54 | 30.3 | 1.00 | 0.5 | 7.11 | 0.0 | NaN | 1.0 | 42.66 | 1.0 | 1.0 | 2.75 |
| 2 | 2 | 2 | 2024-01-01 00:30:21 | 2024-01-01 00:49:23 | N | 1.0 | 74 | 262 | 1.0 | 3.08 | 19.8 | 1.00 | 0.5 | 3.00 | 0.0 | NaN | 1.0 | 28.05 | 1.0 | 1.0 | 2.75 |
| 3 | 3 | 1 | 2024-01-01 00:30:20 | 2024-01-01 00:42:12 | N | 1.0 | 74 | 116 | 1.0 | 2.40 | 14.2 | 1.00 | 1.5 | 0.00 | 0.0 | NaN | 1.0 | 16.70 | 2.0 | 1.0 | 0.00 |
| 4 | 4 | 2 | 2024-01-01 00:32:38 | 2024-01-01 00:43:37 | N | 1.0 | 74 | 243 | 1.0 | 5.14 | 22.6 | 1.00 | 0.5 | 6.28 | 0.0 | NaN | 1.0 | 31.38 | 1.0 | 1.0 | 0.00 |
| 5 | 5 | 1 | 2024-01-01 00:43:41 | 2024-01-01 01:00:23 | N | 1.0 | 33 | 209 | 1.0 | 2.00 | 17.0 | 3.75 | 1.5 | 2.00 | 0.0 | NaN | 1.0 | 24.25 | 1.0 | 1.0 | 2.75 |
| 6 | 6 | 1 | 2024-01-01 00:31:56 | 2024-01-01 00:48:09 | N | 1.0 | 74 | 238 | 2.0 | 3.20 | 18.4 | 3.75 | 1.5 | 4.70 | 0.0 | NaN | 1.0 | 28.35 | 1.0 | 1.0 | 2.75 |
| 7 | 7 | 2 | 2024-01-01 00:46:12 | 2024-01-01 00:57:39 | N | 1.0 | 166 | 239 | 2.0 | 2.01 | 13.5 | 1.00 | 0.5 | 5.62 | 0.0 | NaN | 1.0 | 24.37 | 1.0 | 1.0 | 2.75 |
| 8 | 8 | 2 | 2024-01-01 00:38:07 | 2024-01-01 00:39:23 | N | 1.0 | 226 | 226 | 1.0 | 0.31 | 3.7 | 1.00 | 0.5 | 0.00 | 0.0 | NaN | 1.0 | 6.20 | 2.0 | 1.0 | 0.00 |
| 9 | 9 | 2 | 2024-01-01 00:44:24 | 2024-01-01 00:57:47 | N | 1.0 | 7 | 129 | 1.0 | 2.32 | 14.9 | 1.00 | 0.5 | 3.48 | 0.0 | NaN | 1.0 | 20.88 | 1.0 | 1.0 | 0.00 |
| Unnamed: 0 | VendorID | lpep_pickup_datetime | lpep_dropoff_datetime | store_and_fwd_flag | RatecodeID | PULocationID | DOLocationID | passenger_count | trip_distance | fare_amount | extra | mta_tax | tip_amount | tolls_amount | ehail_fee | improvement_surcharge | total_amount | payment_type | trip_type | congestion_surcharge | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 56541 | 56541 | 2 | 2024-01-31 18:32:52 | 2024-01-31 18:43:13 | NaN | NaN | 75 | 42 | NaN | 1.93 | 15.00 | 0.0 | 0.0 | 0.00 | 0.0 | NaN | 1.0 | 16.00 | NaN | NaN | NaN |
| 56542 | 56542 | 2 | 2024-01-31 18:19:00 | 2024-01-31 18:35:00 | NaN | NaN | 188 | 61 | NaN | 1.60 | 12.61 | 0.0 | 0.0 | 2.72 | 0.0 | NaN | 1.0 | 16.33 | NaN | NaN | NaN |
| 56543 | 56543 | 2 | 2024-01-31 19:23:00 | 2024-01-31 19:31:00 | NaN | NaN | 166 | 151 | NaN | 1.09 | 11.74 | 0.0 | 0.0 | 2.00 | 0.0 | NaN | 1.0 | 14.74 | NaN | NaN | NaN |
| 56544 | 56544 | 2 | 2024-01-31 19:14:00 | 2024-01-31 19:23:00 | NaN | NaN | 193 | 146 | NaN | 1.52 | 11.38 | 0.0 | 0.0 | 1.24 | 0.0 | NaN | 1.0 | 13.62 | NaN | NaN | NaN |
| 56545 | 56545 | 2 | 2024-01-31 19:41:00 | 2024-01-31 19:57:00 | NaN | NaN | 41 | 237 | NaN | 2.75 | 16.39 | 0.0 | 0.0 | 1.01 | 0.0 | NaN | 1.0 | 21.15 | NaN | NaN | NaN |
| 56546 | 56546 | 2 | 2024-01-31 20:46:00 | 2024-01-31 20:55:00 | NaN | NaN | 33 | 25 | NaN | 0.00 | 11.58 | 0.0 | 0.0 | 3.14 | 0.0 | NaN | 1.0 | 15.72 | NaN | NaN | NaN |
| 56547 | 56547 | 2 | 2024-01-31 21:06:00 | 2024-01-31 21:11:00 | NaN | NaN | 72 | 72 | NaN | 0.49 | 11.58 | 0.0 | 0.0 | 0.00 | 0.0 | NaN | 1.0 | 12.58 | NaN | NaN | NaN |
| 56548 | 56548 | 2 | 2024-01-31 21:36:00 | 2024-01-31 21:40:00 | NaN | NaN | 72 | 72 | NaN | 0.52 | 11.58 | 0.0 | 0.0 | 2.52 | 0.0 | NaN | 1.0 | 15.10 | NaN | NaN | NaN |
| 56549 | 56549 | 2 | 2024-01-31 22:45:00 | 2024-01-31 22:51:00 | NaN | NaN | 41 | 42 | NaN | 1.17 | 14.22 | 0.0 | 0.0 | 0.00 | 0.0 | NaN | 1.0 | 15.22 | NaN | NaN | NaN |
| 56550 | 56550 | 2 | 2024-01-31 22:28:00 | 2024-01-31 22:59:00 | NaN | NaN | 33 | 91 | NaN | 9.27 | 44.62 | 0.0 | 0.0 | 4.56 | 0.0 | NaN | 1.0 | 50.18 | NaN | NaN | NaN |